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Abstract: Clustering plays an important role in data mining, pattern recognition, and machine learning. Single- 
valued neutrosophic sets (SVNSs) are useful means to describe and handle indeterminate and inconsistent 
information that fuzzy sets and intuitionistic fuzzy sets cannot describe and deal with. To cluster the data repre- 
sented by single-valued neutrosophic information, this article proposes single-valued neutrosophic clustering 
methods based on similarity measures between SVNSs. First, we define a generalized distance measure between 
SVNSs and propose two distance-based similarity measures of SVNSs. Then, we present a clustering algorithm 
based on the similarity measures of SVNSs to cluster single-valued neutrosophic data. Finally, an illustrative 
example is given to demonstrate the application and effectiveness of the developed clustering methods. 
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1 Introduction 

Clustering plays an important role in data mining, pattern recognition, information retrieval, microbiol- 
ogy analysis, and machine learning. Clustering data sets into disjoint groups is a problem arising in many 
domains. Generally, the goal of clustering is to find groups that are both homogeneous and well separated; 
that is, entities within the same group should be similar and entities in different groups dissimilar. However, 
because of the fuzziness and uncertainty of many practical problems in the real world, Zadeh [12] first pro- 
posed the theory of fuzzy sets, which has achieved great success in various fields. Fuzzy clustering analysis is 
a fundamental but important tool in fuzzy data analysis. Thus, Ruspini [3] first presented the concept of fuzzy 
division and a fuzzy clustering approach. Later, the intuitionistic fuzzy set (IFS) introduced by Atanassov [1] 
has been found to be highly useful in dealing with vagueness. The concept of IFSs is a generalization of that 
of fuzzy sets. The IFSs consider three aspects of information: membership, non-membership, and hesitancy. 
Therefore, it is much more flexible and practical than traditional fuzzy sets in dealing with vagueness and 
uncertainty problems. Hence, Zhang et al. [13] and Xu et al. [7] proposed clustering algorithms for IFSs based 
on association coefficients and similarity measures of IFSs, and then extended the algorithms to cluster 
interval- valued IFSs (IVIFSs) proposed by Atanassov and Gargov [2], However, in the above clustering tech- 
nique, fuzzy sets, IFSs, and IVIFSs cannot describe and deal with indeterminate information and inconsist- 
ent information that exist in the real world. To represent uncertain, imprecise, incomplete, and inconsistent 
information, Smarandache [4] gave the concept of a neutrosophic set from a philosophical point of view. The 
neutrosophic set is a powerful general formal framework that generalizes the concept of the classic set, fuzzy 
set, interval-valued fuzzy set, IFS, IVIFS, paraconsistent set, dialetheist set, paradoxist set, and tautologi- 
cal set [4]. In the neutrosophic set, truth-membership, indeterminacy-membership, and falsity-membership 
are represented independently. However, the neutrosophic set generalizes the above-mentioned sets from a 
philosophical point of view and its functions TJx), I A {x), and F A {x) are real standard or non-standard subsets 
of ] 0, l + [, i.e., T a {x): X — > ] 0, l + [, I A (x)\ X — > ] 0, l + [, and F A (x): X — > ] 0, l + [, and there is no restriction on the 
sum of T a (x), I a (x), and Fjx), i.e., 0 < sup T A {x) + sup I f (x) + sup F A {x) < 3 + . Thus, it will be difficult to apply 
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in real scientific and engineering areas [6]. Thus, Wang et al. [6] introduced a single-valued neutrosophic set 
(SVNS), which is an instance of a neutrosophic set. It can describe and handle indeterminate information and 
inconsistent information. 

The IFS contains both the truth-membership t A (x) and the falsity-membership f A (x) with t A (x), f A (x) e [0, 
1], and 0 < t A (x) +f A (x) < 1 , and can only handle incomplete information (set incompletely known) but cannot 
handle the indeterminate information that is the zone of ignorance of a proposition’s value between truth 
and falsehood (inconsistent information). The indeterminacy in an IFS is 1 - t A {x) - f A (x) (i.e., hesitancy or 
unknown degree) by default, while the indeterminacy in a neutrosophic set is quantified explicitly, and then 
the component of the indeterminacy I{x ) can be split into more subcomponents in order to better catch the 
vague information in the real world [4]. However, the truth-membership, the indeterminacy-membership, and 
the falsity- membership are independently represented in the neutrosophic set. Its components, T{x), I{x), F(x), 
are non-standard subsets included in the unitary non-standard interval]CT, l + [ or standard subsets included in 
the unitary standard interval [0, 1] as in the IFS. Furthermore, the connectors in the IFS are only defined by T(x) 
and F(x) (i.e., truth-membership and falsity-membership); hence, the indeterminacy I{x) is what is left from 
1, while in the neutrosophic set, they can be defined by any of them (no restriction) [4]. For example, when 
we ask the opinion of an expert about a certain statement, he/she may say that the possibility in which the 
statement is true is 0.6 and the statement is false is 0.5 and the degree in which he/she is not sure is 0.2. For a 
neutrosophic notation, it can be expressed asx(0.6, 0.2, 0.5). For another example, suppose there are 10 voters 
during a voting process. Five vote “aye,” two vote “blackball,” and three are undecided. For neutrosophic 
notation, it can be expressed as x(0.5, 0.3, 0.2). However, these expressions are beyond the scope of the IFS. 
Therefore, the notion of a neutrosophic set is more general and overcomes the aforementioned issues. 

Recently, Ye [8, 9] presented the correlation coefficient of SVNSs and the cross-entropy measure of SVNSs 
and applied them to single-valued neutrosophic decision-making problems. Wang et al. [5] proposed the 
theory and application of interval neutrosophic sets. Then, Ye [11] proposed similarity measures between 
interval neutrosophic sets and their applications in multicriteria decision making. Furthermore, Ye [10] 
introduced the concept of simplified neutrosophic sets and simplified neutrosophic weighted aggregation 
operators, and then applied them to multicriteria decision-making problems under a simplified neutrosophic 
environment. 

Yet, until now, there have been no studies on clustering of data represented by single-valued neutro- 
sophic information. However, the existing clustering algorithms cannot cluster single-valued neutrosophic 
data. Motivated by intuitionistic fuzzy clustering algorithms [7, 13], this article proposes a single-valued neu- 
trosophic clustering algorithm to deal with data represented by SVNSs. To do so, the rest of the article is 
organized as follows. Section 2 introduces some basic concepts of SVNSs. Section 3 defines a generalized 
distance measure between SVNSs and proposes two distance-based similarity measures. In Section 4, sin- 
gle-valued neutrosophic clustering methods are proposed based on the similarity measures of SVNSs as an 
extension of intuitionistic fuzzy clustering algorithms. Section 5 gives an illustrative example and a discus- 
sion of the clustering analyses. Conclusions and further research are contained in Section 6. 



2 Basic Concepts of SVNSs 

The neutrosophic set is a part of neutrosophy and generalizes fuzzy sets, interval-valued fuzzy set, IFS, and 
IVIFS from a philosophical point of view [4] . Smarandache [4] originally gave the definition of a neutrosophic 
set. 

Definition 1 ([4]). Let X be a space of points (objects), with a generic element in X denoted by x. A neutrosophic 
set A in X is characterized by a truth-membership function T A (x), an indeterminacy- membership function I A {x), 
and a falsity-membership function F A (x). The functions T A (x), /,(*), and F A (x) are real standard or non-standard 
subsets of ] 0, l + [. That is, T A (x): X — > ] 0, l + [, I A (x): X->] 0, l + [, and Fjx): X — > ] 0, l + [. Thus, there is no restric- 
tion on the sum of T A {x), I A (x), and F A (x), so 0 < sup T A (x) + sup l A (x) + sup F A (x) < 3 + . 
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Obviously, it is difficult to apply in real scientific and engineering applications [6]. Hence, Wang et al. [6] 
proposed an SVNS as a subclass of a neutrosophic set and introduced the definition of an SVNS. 

Definition 2 ([6]). Let Abe a space of points (objects) with generic elements in X denoted by x. An SVNS A in A 
is characterized by truth-membership function T A (x), indeterminacy-membership function I A (x), and falsity- 
membership function F A (x). Then, an SVNS A can be denoted by 

A={{x, T a {x), I a (x), F a (x))|xeA}, 



where T A {x), I A (x), F A (x) e [0, 1] for each point x in X. Therefore, the sum of T A {x), I A {x), and F A (x) satisfies the 
condition 0 < T A {x) + 1 A {x) + F A {x) < 3. 

Definition 3 ([6]). The complement of an SVNS A is denoted by A c and is defined as T A {x) = F A (x), I A (x) = 1 
- I A (x), F a (x) = T A (x) for any x in X. Then 

A c ={(x, Fjx), 1 ~I A (x), r A (x))|xeX}. 

Definition 4 ([6]). An SVNS A is contained in the other SVNS, B;A<zB, if and only if T A (x) < T B {x), l A (x) > / B (x), 
F a (x) > F b {x) for any x in X. 

Definition 5 ([6]). Two SVNSs A and B are equal, written as A = B, if and only if A c B and B c A. 



3 Distance-Based Similarity Measures between SVNSs 

For two SVNSs A and B in a universe of discourse X = {x 15 x 2 ,...,x n }, which are denoted by A = {(x., T A (x), 
I A {x), F a {x))\x. g X} and B = {(x., T B (x t ), I B {x), F fl (x.)) |x. e X}, where T A (x), I A {x), F A (x), T B (x), I B (x), F B {x .) e 
[0, 1] for every x. e X. Let us consider the weight w f (i = 1, 2 of an element x. (i = 1, 2 with 
w. > 0 (i = 1, 2 and w ( = l. Then, we define the generalized single-valued neutrosophic weighted 

distance measure: 






1 1 Ip 



( 1 ) 



where p > 0. 

As the Hamming distance and Euclidean distance, which are two typical distance measures, are usually 
used in practical applications [11], when p = 1 , 2, we can obtain the single-valued neutrosophic weighted 
Hamming distance and the single-valued neutrosophic weighted Euclidean distance, respectively, as follows: 



dSA,B)=±±w t [ |r i (x,.)-r B (x i )| + |J A (x i )-T B (x.)| + |F 1 (x,.)-F B (x i )|], 
3 1 = 1 



( 2 ) 



w.[|F 4 (x.)-F JJ (x.)| 2 + |/ j5 (x.)-/ B (x.)| 2 + |F A (x.)-F Jj (x.)| 2 ] 

Therefore, Eqs. (2) and (3) are the special cases of Eq. (1). 

Then, for the distance measure, we have the following proposition. 



d(A, B)-. 






(3) 
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Proposition 1. The above-defined distance d (A, B)forp > 0 satisfies the following properties: 

(DPI) 0 < d p (A, B ) < 1; 

(DP2) d p (A , B) = 0 if and only if A = B; 

(DP3) d p (A, B) = d p (B, A); 

(DP4) If A qB^C.C is an SVNS in X, then d p (A, C ) > d p (A, B ) and d p (A, C) > d p (B, C). 

Proof. It is easy to see that d p (A, B) satisfies the properties (DP1)-(DP3). Therefore, we only prove (DP4). Let A 
cBcC, then, T A (x) < T B (x) < T c (x), I A (x) > I B (x) > 7 c (x.), and F A (x) > F B (x.) > F f .(x.) for every x. e X. Then, we 
obtain the following relations: 



]r 5 (x.)-r B (x)r<|r A (x)-r c (x)|Mr B (x i )-r c (x i )] p <|r A (x)-r c (x)] p , 
|7 A (x.)-7 B (x.)| p <|7 A (x.)-7 c (x.)| p , |7 b (x.)-7 c (x.)| p <|7 a (x.)-7 c (x.)| p , 

I -^i( X; ) - - ffiC x, ) I p ^ I T^C x. )-F c ( x. ) | p , |F b ( x« ) - F c C x. ) | p < | F a ( x. ) -F c ( x. ) | p . 



Hence, 

| T,(x.)-T (x.) | p + | 7 (x.)-7 (x.) | p + | F (x.)-F (x.) | p 

i j / i i X J B \ j J i x j j / i 

<|r A (x i )-r c (x i )| p + |7 A (x i )-7 c (x.)| p + |F A (x i )-F c (x i )| p ’ 

|r B (x i )-r c (x i )| p + |7 B (x i )-7 c (x i )| p + |F B (x i )-F c (x i )| p 
<|r A (xJ-r c (xJ| p + |7 A (x.)-7 c (x i )| p + |F A (x.)-F c (x.)| p - 

Combining the above inequalities with the above-defined distance formula (1), we can obtain 
d p (A, B ) < d p (A, C) and d p (B, C ) < d p (A, C ) for p > 0. 

Thus, the property (DP4) is satisfied. 

This completes the proof. □ 

Note that similarity and distance (dissimilarity) measures are complementary: when the first increases, 
the second decreases. Normalized distance measure and similarity measure are dual concepts. Thus, SG4, 
B) = 1 - d(A, B) and vice versa. The properties of distance measures below are complementary to those of 
similarity measures. 

Proposition 2. Let A and B be two SVNSs in a universe of discourse X = {x t , x 2> . . ,,xj; SC4, B) is called a single- 
valued neutrosophic similarity measure, which should satisfy the following properties: 

(SP1) 0 <S(A,B)< 1; 

(SP2) S(i4, B) = 1 if and only if A = B; 

(SP3) S(A, B) = S(B, A); 

(SP4) S(A, C) < S(A, B) and S(A, C ) < S(B, C) ifA<zB^ Cfor an SVNS C. 

Assume that there are two SVNSs A = {(x., T A (x), I A (x .), F A (x.))|x. e X] and B = {(x., T B (x), I B (x .), F B (x.))|x. e X] 
in a universe of discourse X = {x 15 x 2> . . .,xj. Thus, according to the relationship between the distance and the 
similarity measure, we can obtain the following single-valued neutrosophic similarity measure: 



S 1 (A,B) = l-d(A,B) 






i Up 



= ! -3 IwJ|r A (x i )-r B (x.)| p + |7 A (x.)-7 B (x i )| p + |F A (x i )-F B (x i )| p ] . 



(4) 



Obviously, we can easily prove that Sj(A, B) satisfied the properties (SP1)-(SP4) in Proposition 2 by the rela- 
tionship between the distance and the similarity measure and the proof of Proposition 1, which is omitted here. 
Furthermore, we can also propose another single-valued neutrosophic similarity measure: 
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s 2 {a,b)=- 



1 -d p (A, B ) 
1 +d p (A, B) 




Up ' 



(5) 



Then, the similarity measure S 2 (A, B ) also satisfied the properties (SP1)-(SP4) in Proposition 2. 

Proof. It is easy to see that SfA , B) satisfies the properties (SP1)-(SP3). Therefore, we only prove the property 
(SP4). 

As we obtain d {A, B) < d p {A, C) and d ( B , C) < d p {A, C ) for p>0 from the property (DP4) in Proposition 1, 
there are 1 - d p (A, B) > 1 - d p (A, C), 1 - d p {B, C ) > 1- d p (A, C), 1 + d p (A, B) < 1 + d p (A, C), and 1 + d p (B, C) < 
1 + d p {A, C). Then, there are the following inequalities: 



Then, there are S(A, C) < S(A, B) and S(A, C) < S(B, C). Hence, the property (SP4) is satisfied. 



Example 1 . Assume that we have the following three SVNSs in a universe of discourse X = {jq, xj: 
A = {<x 1 , 0.1, 0.5, 0.6>, <x 2 , 0.2, 0.5, 0.7>}, 

B = {<Xj, 0.3, 0.4, 0.5>, <x 2 , 0.5, 0.3, 0.4>}, 

C = {<x 1 , 0.6, 0.1, 0.2>, <x 2 , 0.8, 0.1, 0.3>}. 



x. in X = {Xj, x 2 }, and the weight vector w = (0.5, 0.5) T . 

By applying Eq. (4) (takep = 1), the similarity measures between the SVNSs are as follows: 
S t (A, B) = 0.8, S t (B, C) = 0.75, and S^A, C) = 0.55. 

Thus, S,(A, C) < S^A, B) and S,(A, C) < SfB, C). 

Whenp = 2, the similarity measures between the SVNSs are as follows: 

S^A, B) = 0.784, SfB, C) = 0.7386, and S,(A, C) = 0.5436. 

Hence, S^A, C) < S,(A, B) and S 2 (A, C) < SfB, C ). 

By applying Eq. (5) for p = 1, the similarity measures between the SVNSs are as follows: 
S 2 (A, B) = 0.6667, S 2 {B, C) = 0.6, and S 2 (A, C) = 0.3793. 

Thus, S 2 (A, C) < S(A, B) and S 2 (A, C) < S 2 {B, C ). 

Whenp = 2, the similarity measures between the SVNSs are as follows: 

S 2 (A, B) = 0.6447, S 2 {B, C) = 0.5855, and S 2 (A, C) = 0.3732. 

Hence, S 2 (A, C) < S 2 (A, B) and S 2 (A, C) < S 2 (B, C). 



4 Clustering Algorithm Based on the Similarity Measures of SVNSs 



In this section, we can apply the proposed similarity measures of SVNSs to clustering analysis under a single- 
valued neutrosophic environment. 

On the basis of the intuitionistic fuzzy clustering algorithm proposed by Zhang et al. [13] and Xu et al. [7], 
we first introduce the following definitions. 



l-d p (A,B)^l-d J ,(A,C) ^ 1 -d p (B, C)J-d p (A,C ) 
l+d p (A,B)"l+d p (A, C) “ l+d p (B, C)“l+d p (A, C) 



This completes the proof. 



□ 
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Definition 6. Assume that A = (Aj, A 2 ,...,AJ is a set of SVNSs andC= (s i .) mxm is a similarity matrix, where s.. = 
S k (A., A) (k = 1, 2) ands.. g [0, 1] for i,j = 1, 2 withs.. = 1 for i = 1, 2 and s.. = s. f for i,j = 1, 2 

Definition 7 ([7, 13]). Let C = (s i .) mxm be a similarity matrix, if C 2 = C o C = (s.) mxm , then C 2 is called a compo- 
sition matrix of C, where s..=max t {min(s ft , s kj )} for i,j = 1, 2 

Definition 8 ([7, 13]). Let C = (s,.) mxm be a similarity matrix, if C 2 c C, i.e., s.. < s. . for i, ; = 1, 2 then C is 
called an equivalent similarity matrix. 

Definition 9 ([7, 13]). Let C = (s tJ ) mxm be a similarity matrix. Then, after finite time compositions of C: 

C->C 2 ->C 4 ->...->C 2 *->..., (6) 



- k -dfc + 1) yk 

there must exist a positive integer k such that C =C , then C is also an equivalent similarity matrix. 

Definition 10 ([7, 13]). Let C = (s fj .) mxm be an equivalent similarity matrix. Then, C. =( s 2 ) mxm is called the 
A-cutting matrix of C, where 



. = | 0 ,s, ? <A; 



fori, 7 = 1, 2 m, 



(7) 



and A is the confidence level with A g [0, 1] . 

Assume that A = (A l5 A 2 ,...,AJ is a set of SVNSs, where A={(x.,T a {x.),I a {x.),F a U.))\x.£X} (j = 1, 

2.. ..,m) in a universe of discoursed = {jq, x 2 ,...,xj is an SVNS. Let w. be the weight for each element x. (i = 1, 

2.. ..,n), with w. g [0,1], and w, = l. Then, we can give the algorithm of clustering SVNSs as follows: 

Step 1. By use of Eqs. (4) or (5), one can calculate the similarity measure degrees of SVNSs, and then construct 
a similarity matrix C = (s i5 ) mxm , where s.. = S k (A., A.) (k = 1, 2) for i, ; = 1, 2, . . . ,m. 



Step 2. The process of building the composition matrices is repeated until it holds that 

C— >c 2 —>c 4 — >...—> c 2,t =c 2<k+1) , 

which implies that C 2 is an equivalent similarity matrix, which is denoted by C ={s..) mxm . 

Step 3. For the equivalent similarity matrix C =( V. ) m m , we can construct a A-cutting matrix C i =(s.p mxm of 
C by Eq. (7); if all the elements of the ith row or column in C. are the same as the corresponding elements of 
the ;'th row or column, we conceive object sets A f and A. are the same class. 



5 Illustrative Example and Discussion 



In this section, a real example adapted from Zhang et al. [13] is employed to demonstrate the application and 
effectiveness of the proposed clustering methods under a single-valued neutrosophic data environment. 

A car market is going to classify five different cars of A. (; = 1, 2,..., 5). Every car has six evaluation factors 
(attributes): (i) x v fuel consumption; (ii) x 2 , coefficient of friction; (iii) x v price; (iv) xq, comfortable degree; 
(v) x 5 , design; (vi) x 6 , security coefficient. The characteristics of each car under the six attributes are repre- 
sented by the form of SVNSs, and then the single-valued neutrosophic data are as follows: 



A = {<x , 0.3, 0.2, 0.5>, <x , 0.6, 0.3, 0.1>, <x , 0.4, 0.3, 0.3>, <x , 0.8, 0.1, 0.1>, <x , 0.1, 0.3, 0.6>, 



<x 6 , 0.5, 0.2, 0.4>}, 



Authenticated | yehjun@aliyun.com author's copy 
Download Date | 10/16/14 6:43 AM 



DE GRUYTER 



|. Ye: Clustering Based on Similarity Measures of SVNSs 385 



A 2 = {<x 3 , 0.6, 0.3, 0.3>, <x 2 , 0.5, 0.4, 0.2>, <x y 0.6, 0.2, 0.1>, <x 4 , 0.7, 0.2, 0.1>, 

<x 5 , 0.3, 0.1, 0.6>, <x 6 , 0.4, 0.3, 0.3>}, 

A 3 = {<Xj, 0.4, 0.2, 0.4>, <x 2 , 0.8, 0.2, 0.1>, <x 3 , 0.5, 0.3, 0.1>, <x 4 , 0.6, 0.1, 0.2>, 

<x 5 , 0.4, 0.1, 0.5>, <x 6 , 0.3, 0.2, 0.2>}, 

A h = {<Xj, 0.2, 0.4, 0.4>, <x 2 , 0.4, 0.5, 0.1>, <x y 0.9, 0.2, 0.0>, <x 4 , 0.8, 0.2, 0.1>, 

<x 5 , 0.2, 0.3, 0.5>, <x 6 , 0.7, 0.3, 0.1>}, 

A s = {oq, 0.5, 0.3, 0.2>, <x 2 , 0.3, 0.2, 0.6>, <x 3 , 0.6, 0.1, 0.3>, <x 4 , 0.7, 0.1, 0.1>, 

<x 5 , 0.6, 0.2, 0.2>, <x 6 , 0.5, 0.2, 0.3>}. 

If the weight vector of the attribute x. (i = 1, 2,...,6) is w = (1/6, 1/6, 1/6, 1/6, 1/6, 1/6) T , then we utilize the 
two single-valued neutrosophic similarity measures to classify the five different cars of A (j = 1, 2,. . .,5) by the 
single-valued neutrosophic clustering algorithms. 



5.1 Clustering Analysis Using Eq. (4) 



Step 1. Utilize the similarity measure formula (4) (take p = 2) to calculate the similarity measures between 
each pair of SVNSs A. and A. (i, j = 1, 2, 3, 4, 5) and construct the following similarity matrix: 



C= 



1 0.8528 0.8528 0.8085 0.7631 

0.8528 1 0.8709 0.8317 0.8174 



C 2 = 



0.8528 


0.8709 


1 


0.7853 


0.7814 


0.8085 


0.8317 


0.7853 


1 


0.7585 


0.7631 


0.8174 


0.7814 


0.7585 


1 


y matrices by limited time compositions o 


1 


0.8528 


0.8528 


0.8317 


0.8174 


0.8528 


1 


0.8709 


0.8317 


0.8174 


0.8528 


0.8709 


1 


0.8317 


0.8174 


0.8317 


0.8317 


0.8317 


1 


0.8174 


0.8174 


0.8174 


0.8174 


0.8174 


1 


1 


0.8528 


0.8528 


0.8317 


0.8174 


0.8528 


1 


0.8709 


0.8317 


0.8174 


0.8528 


0.8709 


1 


0.8317 


0.8174 


0.8317 


0.8317 


0.8317 


1 


0.8174 


0.8174 


0.8174 


0.8174 


0.8174 


1 



Obviously, C' = C 2 implies that C 2 is an equivalent similarity matrix, denoted by C. 



Step 3. When A has different values, we can construct a A-cutting matrix C\=(s.') mxm of C by Eq. (7) and 
obtain different categories, which give the following discussion: 



(i) 



If 0 < A < 0.8174, 



c,= 



1 1 
1 1 
1 1 
1 1 
1 1 



1 1 
1 1 
1 1 
1 1 
1 1 



1 

1 

1 , 

1 

1 
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then the cars are the same category: {d 3 , d 2 , A 3 , d 4 , d_}. 



(ii) If 0.8174 < A < 0.8317, C 



11110 
11110 
11110 
11110 
0 0 0 0 1 



then the cars can be divided into two categories: {d 3 , d 2 , d 3 , dj, {d 5 }. 



(iii) If 0.8317 <X< 0.8528, C x = 



1110 0 
1110 0 
1110 0 
0 0 0 1 0 
0 0 0 0 1 



then the cars can be divided into three categories: {A } , d 2 , A}, {dj, {d 5 }. 



(iv) If 0.8528 <X< 0.8709, C = 



1 0 0 0 0 
0 110 0 
0 110 0 
0 0 0 1 0 
0 0 0 0 1 



then the cars can be divided into four categories: fdj, {d 2 , d 3 }, {dj, {d 5 }. 



(v) If 0.8709 < k < 1, C k 



1 0 0 0 0 
0 10 0 0 
0 0 10 0 
0 0 0 1 0 
0 0 0 0 1 



then the cars can be divided into five categories: {dj, {d 2 }, {d 3 }, {dj, {d 5 }. 



5.2 Clustering Analysis Using Eq. (5) 



Step 1. Utilize the similarity measure formula (5) (take p = 2) to calculate the similarity measures between 
each pair of SVNSs d. and d. (i, j = 1, 2, 3, 4, 5) and construct the following similarity matrix: 



C= 



1 0.7434 0.7434 0.6786 0.6170 

0.7434 1 0.7713 0.7119 0.6912 

0.7434 0.7713 1 0.6464 0.6413 

0.6786 0.7119 0.6464 1 0.6109 

0.6170 0.6912 0.6413 0.6109 1 



Step 2. Obtain equivalent similarity matrices by limited time compositions of C: 
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1 


0.7434 


0.7434 


0.7119 


0.6912 


0.7434 


1 


0.7713 


0.7119 


0.6912 


0.7434 


0.7713 


1 


0.7119 


0.6912 


0.7119 


0.7119 


0.7119 


1 


0.6912 


0.6912 


0.6912 


0.6912 


0.6912 


1 


1 


0.7434 


0.7434 


0.7119 


0.6912 


0.7434 


1 


0.7713 


0.7119 


0.6912 


0.7434 


0.7713 


1 


0.7119 


0.6912 


0.7119 


0.7119 


0.7119 


1 


0.6912 


0.6912 


0.6912 


0.6912 


0.6912 


1 



Obviously, C'* = C 2 implies that C 2 is an equivalent similarity matrix, denoted by C. 



Step 3. When A has different values, we can construct a A-cutting matrix C x =( s. 1 ) mxm of C by Eq. (7) and can 
obtain different categories, which make the following discussion: 



(i) 



If 0 < A < 0.6912, 



11111 

11111 

11111 , 

11111 

11111 



then the cars are the same category: {A v A 2 , A y A h , 71.}. 



(ii) If 0.6912 < A < 0.7119, C. 



11110 
11110 
11110 
11110 
0 0 0 0 1 



then the cars can be divided into two categories: {A x , A 2 , A y 71,}, {t! 5 }. 



(iii) If 0.7119 < A < 0.7434, C x = 



1110 0 
1110 0 
1110 0 
0 0 0 1 0 
0 0 0 0 1 



then the cars can be divided into three categories: {A v A p 71,}, {71 J, {t! 5 }. 



(iv) 



If 0.7434 < A < 0.7713, 



c,= 



0 

1 

1 

0 

0 



0 0 0 

1 0 0 

10 0 , 

0 1 0 

0 0 1 
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then the cars can be divided into four categories: {Aj, {A 2 , A}, {A,\, {/L}. 



(v) If 0.7713 < A < 1, C 



1 0 0 0 0 
0 10 0 0 
0 0 10 0 
0 0 0 1 0 
0 0 0 0 1 



then the cars can be divided into five categories: {Aj, {A), {A}, {Aj, {A.}. 



5.3 Discussion 

From the above clustering results, we know the two similarity measures can be applied to clustering SVNSs 
and these clustering results using the two similarity measures are the same. However, the literature [13] 
obtained three situations by the clustering algorithm based on the similar measure of IFSs; however, we 
can obtain five situations by the clustering algorithm based on the proposed similarity measures of SVNSs. 
Hence, we can see that the clustering algorithms based on the two similarity measures of SVNSs have better 
accuracy in clustering problems. 

As mentioned above, the single-valued neutrosophic information is a generalization of intuitionistic 
fuzzy information, and intuitionistic fuzzy information is a further generalization of fuzzy information. On 
the one hand, an SVNS is an instance of a neutrosophic set, which gives us an additional possibility to rep- 
resent uncertain, imprecise, incomplete, and inconsistent information that exist in the real world. It can 
describe and handle indeterminate information and inconsistent information. However, the connector in 
the fuzzy set is defined with respect to T, i.e., membership only; hence, the information of indeterminacy 
and non-membership is lost. The connectors in the IFS are defined with respect to T and F, i.e., membership 
and non-membership only; hence, the indeterminacy is what is left from 1, and then the IFS can only handle 
incomplete information but not the indeterminate information and inconsistent information. While in the 
SVNSs, its truth-membership, indeterminacy-membership, and falsity-membership are represented indepen- 
dently, and then they can be defined with respect to any of them (no restriction). Thus, the notion of SVNSs 
is more general. On the other hand, the clustering analysis under a single-valued neutrosophic environment 
is suitable for capturing imprecise, uncertain, and inconsistent information in clustering the data. Thus, the 
clustering algorithm based on the similarity measures of SVNSs not only can cluster the single-valued neutro- 
sophic information but also can cluster the intuitionistic fuzzy information and the fuzzy information. Obvi- 
ously, the proposed single-valued neutrosophic clustering algorithm is the extension of both fuzzy clustering 
algorithm and intuitionistic fuzzy clustering algorithm. Therefore, compared with the intuitionistic fuzzy 
clustering algorithm and the fuzzy clustering algorithm, the single-valued neutrosophic clustering algorithm 
is more general. Furthermore, when we encounter some situations that are represented by indeterminate 
information and inconsistent information, the single-valued neutrosophic clustering algorithm can demon- 
strate its great superiority in clustering those single-valued neutrosophic data. 



6 Conclusion 

This article introduced a generalized single-valued neutrosophic weighted distance measure and presented 
two distance-based similarity measures in a single-valued neutrosophic setting. Then, a single-valued neu- 
trosophic clustering algorithm was established on the basis of the two similarity measures. Finally, an illus- 
trative example was given to demonstrate the application and effectiveness of the single-valued neutrosophic 
clustering methods. The clustering results have shown that the single-valued neutrosophic clustering algo- 
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rithm is more general than the intuitionistic fuzzy clustering algorithm and the fuzzy clustering algorithm. 
Furthermore, in the situations that are represented by indeterminate information and inconsistent informa- 
tion, the single-valued neutrosophic clustering algorithm can demonstrate its great superiority in clustering 
those single-valued neutrosophic data, as the SVNSs are a powerful tool to deal with uncertain, imprecise, 
incomplete, and inconsistent information. In the future, the developed clustering algorithm will be extended 
to clustering problems of interval-valued neutrosophic sets and further applied to many areas such as infor- 
mation retrieval, investment decision making, and data mining. 

Received November 15, 2013; previously published online March 7, 2014. 



Bibliography 

[1] K. Atanassov, Intuitionistic fuzzy sets, Fuzzy Sets Syst. 20 (1986), 87-96. 

[2] K. Atanassov and G. Gargov, Interval valued intuitionistic fuzzy sets, Fuzzy Sets Syst. 31 (1989), 343-349. 

[3] E. H. Rusp ini, A new app ro ach to c luste ring, Inform. Control 15 (1969), 22-32. 

[4] F. Smarandache, A unifying field in logics. Neutrosophy: neutrosophic probability, set and logic, American Research Press, 
Rehoboth, 1999. 

[5] H. Wang, F. Smarandache, Y. Q. Zhang and R. Sun derraman, Interval neutrosophic sets and logic: theory and applications in 
computing, Hexis, Phoenix, AZ, 2005. 

[6] H. Wang, F. Smarandache, Y. Q. Zhang and R. Sunderraman, Single valued neutrosophic sets, Multispace Multistruc. 4 
(2010), 410-413. 

[7] Z. S. Xu, J. Chen and J. J. Wu, Clustering algorithm for intuitionistic fuzzy sets, Inform. Sci. 19 (2008), 3775-3790. 

[8] ). Ye, Multicriteria decision-making method using the correlation coefficient under single-valued neutrosophic environ- 
ment, Int. J. General Syst. 42 (2013), 386-394. 

[9] J. Ye, Single v a lued neu t rosoph ic c ross-en tropy for m uiticriteri a de c ision m ak ing pro b lems , Appl. Math. Model. 38 (2014), 
1170-1175. 

[10] ). Ye, A m uiticriteri a de c ision-m ak ing metho d using aggr eg ation op er ators for sim p lified neu t rosoph ic sets ,). Intell. Fuzzy 
Syst. (2013), doi: 10.3233/1 FS-130916. 

[11] J. Ye, Sim i tarity measures between inte rval neu t rosoph ic sets an d their ap plic ations in m uiticriteri a de c ision-m ak ing, J. 
Intell. Fuzzy Syst. 26 (2014), 165-172. 

[12] L. A. Z adeh, F uzzy sets , Inform. Control 8 (1965), 338-353. 

[13] H. M. Zhang, Z. S. Xu and Q. Chen, Clustering method of intuitionistic fuzzy sets, Control Decision 22 (2007), 882-888. 



Authenticated | yehjun@aliyun.com author's copy 
Download Date | 10/16/14 6:43 AM 



